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(54) Packet distribution in a microcomputer 



(57) An integrated circuit device (11) with an ad- 
dress and data path (15) interconnects a CPU (12) with 
at least one module (14) and a memory interface (32), 
the module (14) having circuitry (6) to generate an event 



request packet and the CPU having event logic (44) to 
decode the packet as well as circuitry to generate ad- 
dressed memory access packets, the same address 
and data path (15) being used for the distribution of 
event request packets and memory access packets. 
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Description 

[0001] The invention relates to nnicrocomputers, computer systems and methods of operating such. 
[0002] Microcomputer chips may include a CPU with a plurality of other modules as well as a memory interface on 
the same chip. The CPU as well as other modules may need to carry out memory access operations in order to effect 
read and write operations. Furthermore, some modules on the chip may need to generate interrupt requests to vary 
to routine operated by another device on the chip and in some cases it may be necessary to provide control commands 
to which an on-chip CPU must respond. 

[CK)03] It Is an object of the present invention to provide an improved computer system and method of operating a 
computer system for distributing memory access requests and other requests on a chip. 

[0004] The invention provides a computer system including an integrated circuit chip with an address and data path 
interconnecting a plurality of on-chip devices including at least one CPU. at least one module and a memory interface, 
(a) said module having packet generating circuitry responsive to an event to generate an event request packet including ^ 
a destination address, (b) said CPU having event logic to decode the packet and identify the request of the packet ^ 
and circuitry to generate addressed memory access packets, and (c) said address and data path being used for dis- 
tribution of both event request packets and memory access packets. 

[0005] Preferably both said module and said CPU each include packet generating circuitry operable to generate both 
event request packets and memory access packets for distribution on the common address and data path. 
[0006] Preferably the packet generating circuitry is responsive to receipt of an event request packet to generate an 
addressed response bit packet for distribution on said address and data path. 

[0007] Preferably the packet generating circuitry of a module includes means to indicate the address of the destination 
for the packet as well as the address of the module acting as a source of the packet. 

[0008] Preferably the packet generating circuitry is responsive to receipt of an event request packet to determine 
from the packet a source address of the packet and to generate a response packet using said source address as the 
destination indicator for the response packet. 

[0009] The system may include an on-chip memory, said memory interface providing connection between said ad- 
dress and data path and said on-chip memory. 

[0010] The system may include an off -chip memory, said chip having an external memory interface connected to 
said off-chip memory and to said address and data path. 

[001 1] The packet generating circuitry of said module may be arranged to generate an event request packet forming 
an interrupt request with a priority indicator and said event logic of the CPU includes comparator circuitry for comparing 
priorities of event request packets received with the priority of any current CPU activity 

[0012] The packet generating circuitry of said module may be arranged to generate an event request packet in the 
form of a control packet for control command to the GPU. 

[0013] Preferably a plurality of modules are provided on chip, each having packet generating circuitry for generating 
event request packets, at least one module being arranged to generate event packets in the form of prbritised interrupt 
requests and at least another module being arranged to generate event request packets in the form of control packets 
for the CPU. 

[0014] Preferably said address and data path includes at least one on-chip bus arranged to distribute said packets 
in bit parallel format. 

[0015] Preferably said integrated circuit chip includes at least one external port for off-chip connection, said port 
including bit fomiat translation circuitry to convert on-chip packets of bit parallel format to a less parallel format tor 
transmission off-chip. 

[0016] The invention also provides a method of operating a computer system comprising an integrated circuit chip 
with an address and data path interconnecting a plurality of on-chip devices including at least one CPU, at least one 
module and a memory interface, which method comprises detecting an event at a module, generating an event request 
packet with a destination address, distributing the request packet on the address and data path to the destination, 
decoding the packet at the destination to identify the nature of the request, said method further including generating 
addressed memory access packets for memory read and write operations, said memory access packets and said event 
request packets being distributed on the same address and data path. 

[0017] Preferably event request packets are generated for distribution on the address and data path to the CPU, 
which event request packets are in the form of prioritised interrupt requests. 

[0018] Preferably event request packets are generated for distribution on said address and data path to the CPU, 
at least some of said event request packets being in the form of control command packets to which the CPU must 
respond on receipt of the packet. 

[0019] An embodiment of the present invention wilt now be described by way of example with reference to the ac- 
companying drawings in which: 
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Figure 1 is a block diagram of a microcomputer chip in accordance with the present invention, 
Figure 2 shows more detail of a debug port of the microcomputer of Figure 1 , 
Figure 3 shows input of a digital signal packet through the port of Figure 2, 
Figure 4 shows the output of a digital signal packet to the port of Figure 2, 
Figure 5 shows accessing of registers in the port of Figure 2. 
Figure 6 shows the format of a digital signal request packet which may be used in the microcomputer of Figure 1 , 
Figure 7 shows the format of a digital signal response packet which may be used in the microconnputer of Figure 1 , 
Figure B shows one example of a serial request packet which may be output or input through the port of Figure 2, 
Figure 9 illustrates further details of one CPU of the microcomputer of Figure 1 including special event logic, 
Figure 10 shows further detail of the special event logic of Figure 9= 

20 

Figure 1 1 shows a microcomputer of the type shown in Figure 1 connected to a host computer for use in debugging 
the CPU by operation of the host, 

Figure 12 shows an arrangement similar to Figure 11 in which a second CPU is provided on the same chip and 
25 operates normally while the other CPU is debugged by the host. 

Figure 13 illustrates one CPU forming part of a microcomputer as shown in Figure 1 when connected to a host 
computer for use in watchpoint debugging, 

30 Figure 14 shows a microcomputer of the type shown in Figure 1 connected to a host computer in which one CPU 

on the microcomputer is debugged by the other CPU on the same chip. 

Figure 15 shows more detail of part of the logic circuitry of Figure 10, 

^5 Figure 1 6 shows more detail of part of the logic circuitry of Figure 15, 

Figure 1 7 shows more detail of another part of the logic circuitry of Figure 1 5, and 

Figure 18 shows a block diagram of three interconnected integrated circuit CPU devices in accordance with the 
40 invention, 

Figures 19 to 24 show different bit packet formats for distribution on the address and data paths of the devices 
shown in Figures 1 and 18, 

^5 Figure 25 shows more detail of event handling circuitry in a CPU of the type shown in Figures 1 and 18, 

Figure 26 shows a priority comparator used in the CPU's of Figures 1 and 18, and 

Figure 27 shows an event logic and packet generator for use in modules connected to the data and address path 
50 of the devices shown in Figures 1 and 18. 

[0020] The integrated circuit devices of this embodiment are illustrated in Figures 1 and 18. Figure 1 shows a single 
chip whereas Figure 18 shows three chips interconnected through external ports 30 by wires lO carrying serial bit 
packets. On each chip 11 a CPU 12 is connected to a plurality of modules 14 by a data and address path 15 arranged 
55 to carry bit packets in parallel form. The modules 14 as well as the CPU 12 include event logic used in the distribution 
of bit packets on the path 15. Three types of packet are used on the data and address path 15, each including a 
destination indicator to indicate the required destination device connected to the path 15. The packets include data 
transfer packets which are necessary for memory access operations. In addition there are event packets of two types. 
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Normal event packets form priorrtised interrupts which may be received the CPU or module with the recipient selectively 
deciding whether, or when, to respond to the event packet depending on relative priority with other activities requested 
at that device. Special event packets form command control signals which must be acted on by the recipient device 
when the special event packet is received. In this embodiment modules 14 as well as the CPU 12 have event logic for 
s handling event packet formation and receipt including normal events acting as interrupt requests as well as special 
events acting as control commands. In the example shown in Figure 1 6 each chip 11 includes on-chip memory as well 
M as off-chip memory 1 20. Although in Figure 1 8 each chip includes a single CPU 1 2 more than one CPU may be provided 

;|j on the same chip as shown in Figure 1 . 

% [0021] The preferred embodiment illustrated in Figure 1 comprises a single integrated circuit chip 11 on which is 

10 provided two CPU circuits 1 2 and 1 3 as well as a plurality of modules 1 4, The CPU's 1 2 and 1 3 as well as each module 
14 are interconnected by a bus network 15 having bi-directional connections to each module. In this example the bus 
network is referred to as a P-link consisting of a parallel data bus 20 as shown in Figure 2 together with a dedicated 
control bus 21 provkJed respectively for each module so as to link the module to a P-link control unit 22. Each module 
is provided with a P-link interlace 23 incorporating a state machine so as to interchange control signals between the 

'5 respective P-link control line 21 and the interface 23 as well as transferring data in two opposing directions between 
" the data bus 20 and the interface 23. 

[0022] In the example shown in Figure 1. the various nnodules 14 include a video display interface 25 having an 
external connection 26, a video decode assist circuitry 27, an audio output interface 28 having an external connection 
; j 29, a debug port 30 having an external connection 31, an extemal memory interface 32 having an external bus con- 

20 nection 33 leading to an external memory, clock circuitry 34, various peripheral interfaces 35 provided with a plurality 
of bus and serial wire output connections 36, a network interface 37 with an external connection 38 as well as the P- 
link control unit 22. The two CPU units 1 2 and 1 3 of this example are generally similar in construction and each includes 
a plurality of instruction execution units 40, a plurality of registers 41 an instruction cache 42 and a data cache 43. In 
this example each CPU also includes event logic circuitry 44 connected to the execution units 40, and the other modules 

2^ connected to the P-link each include event logic 6 for handling both normal event and special event packets. The P- 
link 1 5 is arranged to transmit to modules on the link and to the external memory interface both request and response 
packets, including memory access transactions, interrupts in the form of normal events, and control signals in the form 
of special events. These packets may be generated by software as a resuK of instruction execution by a CPU or by 
hardware responsive to detecting of an event. The packets may be generated on chip and distributed on the link 15 

30 or generated off chip and supplied to the on chip link 1 5 through an external port such as the debug port 30. 

[0023] The CPU's can be operated in conventional manner receiving instructions from the instruction caches 42 on 
chip and effecting data read or write operations with the data cache 43 on chip. Additionally external memory accesses 
for read or write operations may be made through the external memory interface 32 and bus connection 33. The debug 
port 30 is described in more detail in Figures 2 to 5. As shown in Figure 2, this circuitry includes a hard reset controller 

35 45 connected to a hard reset pin 46. The controller 45 is connected to all modules on the chip shown in Figure 1 so 
that when the hard reset signal is asserted on pin 46 all circuitry on the chip is reset. 

[0024] As will be described below, this port 30 provides an important external communication which may be used 
for example in debugging procedures. The on-chip CPU's 1 2 and 1 3 may obtain instruction code (by memory access 
Wi :i packets) for execution from an extemal source communicating through the port 30. Furthermore, event packets pro- 

"to viding either interrupts or control signals may be put onto the P-link 15 from an external chip via the port 30. Comnnu- 
1 nications on the P-link system 15 are carried out in bit parallel format. Transmissions on the data bus 20 of the P-ltnk 

; 15 may be carried out In multiple byte packets, for example 35 bytes for each packet, so that one packet is transmitted 

in five consecutive eight byte transfers along the P-link each transfer being in bit parallel format. The port 30 is arranged 
to reduce the parallelism of packets obtained from the P-Iink 15 so that they are output in bit serial format through the 
45 output 31 or alternatively in a much reduced parallel format relative to that used on the P-link 1 5 so as to reduce the 
number of external connection pins needed to implement the extemal connection 31. 
[0025] The structure of the port 30 will now be described with reference to Figures 2 to 5. 
; [0026] In this example the port 30 comprises an outgoing packetising buffer 50 connected to the P-link interface 23 

i as well as an incoming packetising buffer 51 connected to the interface 23. On the output side, the externa! connection 

50 3-| is in this case formed by an output pin 52 and an input pin 53. The port in this case effects a full transition between 
parallel format from the data bus 20 to bit serial format for the input and output pins 52 and 53. The pins 52 and 53 
are connected as part of an output link engine 55 which also incorporates serialiser 56 and de-serialiser 57 connected 
respectively to the outgoing packetising buffer 50 and the incoming packetising buffer 51 . Between the buffers 50 and 
51 are connected by bidirectional connections a register bank 58 and a port state machine 59. The function of the port 
55 30 is to translate bit packets between the internal on-chip parallel format and the external bit serial format. In addition 
it allows packets which are input through pin 53 to access the registers 58 in the port without use of the P-link system 
15. Equally packets on the P-link system 15 can access the registers 58 of the port without using the external pins 52 
or 53. 
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[0027] The format of the multibit packets used on the P-link system 15 in the microcomputer are illustrated by way 
of example in Figures 6. 7 and 8. Figures 6 and 7 show packet formats used in parallel form on the P-link whereas 
Figure 8 shows a packet similar to that of Figure 6 including a length indication when the packet is in serial form. When 
a packet is to be output from the port 30 from one of the modules 14 connected to the P-link 15. the module transmits 

s the parallel representation of the packet along the data bus 20. The packet may comprise a plurality of eight byte 
transfers as already described. Each module 14, including the port 30, have a similar P-link interface 23 and the op- 
eration to take data from the bus 20 or to put data onto the bus 20 is sinnilar tor each. When a module has a packet to 
send to another module, for example to the port 30, it first signals this by asserting a request signal on line 60 to the 
dedicated link 21 connecting that module to the central control 22. It also outputs an eight bit signal on a destination 

10 bus 61 to indicate to the control the intended destination of the packet ft wishes to transmit. It will be understood that 
the P-link 21 Is itself a bus. A module such as the port 30, which is able to receive a packet from the bus 20 will assert 
a signal "grant receive" on line 62 to be supplied on the dedicated path 21 to the central control 22 regardless of whether 
a packet is available to be fed to that destination or not. When the central control 22 determines that a module wishes 
to send a packet to a destination and independently the destinatbn has indicated by the signal on line 22 that it is able 
to receive a packet from the bus 20, the control 22 arranges for the transfer to take place. The control 22 asserts the 
"grant send' signal 63 via the dedicated line 21 to the appropriate interface 23 causing the sending module to put the 
packet onto the P-link data path 20 via the bus 64 interconnecting the interface 23 with the data bus 20. The control 
22 then asserts the "send" signal 65 of the receiver which signals to it that it should accept the transfers currently on 
the P-link data bus 20. The packet transmission concludes when the sender asserts its -"end of packet send" line 66 

20 concurrently with the last transfer of packet data on the bus 20. This signal is fed on the dedicated path 2 1 to the central 
control 22 and the control then asserts the "end of packet received" signal 67 to the receiving module which causes it 
to cease accepting data on the P-link data bus 20 after the current transfer has been received. 
[0028] The parallel to serial translation which takes place in the port 30 has a one to one equivalence between the 
parallel and serial packets so that all data contained in one packet form is contained in the other The translation 

2S therefore involves identifying the type of the packet and copying across fields of the packet in a manner determined 
by the type. When a packet is input to the outgoing packetising buffer 50 from the data bus 20, the packet is held in 
its entirety as the buffer is 35 bytes long in order to hold the longest packet. As shown in Figure 4, buffer 50 is connected 
to the port state machine 59 and to a shift register 70 by a transfer bus 71. The shift register 70 is connected to the 
serialiser 56. The state machine 59 provides input signals 72 to the buffer 50 to copy specific bytes from the P-link 

30 packet onto the transfer bus 71 under the control of the state machine 59. Firstly the most significant byte of the packet, 
which holds the destination header 73. is placed onto the byte wide transfer bus 71 . The state machine 59 compares 
this value with those values which indicate that the packet is destined for the shift register and output serial link. If the 
packet is destined for the output serial link, the state machine causes the next byte 74 of the packet (which is the 
operation code indicating the type of packet) to be placed on the transfer bus 71 . From the opcode 74 which is supplied 

55 to the state machine 59 on the transfer bus 71, the state machine determines the length and format of the packet 
derived from the data bus 20 and therefore determines the length and format of the serial packet which it has to 
synthesise. The state machine 59 outputs a byte which indicates the serial length packet onto the transfer bus 71 and 
this is shifted into the first byte position of the shift register 70. The state machine 59 then causes bytes to be copied 
from the buffer 50 onto the bus 71 where they are shifted into the next byte position in the shift register 70. This continues 

40 until all the bytes from the buffer 50 have been copied across. The order of byte extractions from the buffer 50 is 
contained in the state machine 59 as this determines the refonnalting in serial format. The serial packet may then be 
output by the output engine 55 via pin 52 to externally connected circuitry as will be described with reference to Figures 
11 to 14, 

[0029] When a serial packet is input through pin 53 to the port 30. the translation is dealt with as follows. Each byte 
^5 is passed into the shift register 80 forming a packetising buffer. Such a serial packet is shown in Figure 8 in which the 
first byte 81 indicates the packet size. This will identify the position of the last byte of the packet. Referring to Figure 
3, the register 80 copies bytes in the simple order they are shifted out of the shift register onto a transfer bus 83 under 
the control of the state machine 59. The state machine 59 compares the destination byte 84 of the packet with those 
values which indicate that the packet is destined for the P-link system 1 5. The state machine 59 causes the next byte 
50 85 of the packet to be placed on the transfer bus in order to indicate the type of packet (also known as the opcode) 
and from this the state machine checks the length and format of the serial link packet and those of the P-link packet 
which it has to synthesise. The state machine 59 causes bytes to be shifted out of the register 80 onto bus 83 where 
they are copied into a P-link packet buffer 51. This continues until all serial link bytes have been copied across and 
the positions in which the bytes are copied into the buffer 66 from the shift register 80 is determined by setting of the 
55 state machine 59, This indicates to the interface 23 that a packet is ready to be put on the bus 20 and the interface 
communicates through the dedicated communication path 21 with the central control 22 as previously described. When 
the P-link system 15 is ready to accept the packet the interface responds by copying the first eight bytes of the packet 
onto he data path 20 on the following clock cycle (controlled by clock 34). It copies consecutive eight byte parts of the 
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packet onto the bus 20 on subsequent clock cycles until alt packet bytes have been transnnitted. The final eight bytes 
are concurrent with the end of packet sent signal being asserted by the interface on line 66. 

[0030] As already described, an incoming packet (either parallel or serial) to the port 23 may wish to access port 
registers 58. When the destination byte 84 of an incoming serial bit packet from the pin 53 indicates that the packet is 

5 destined to access registers 58, the bit serial packet is changed to a P-link packet in buffer 51 as already described 
but rather than being tonwarded to the P^ink interface 23, it is used to access the register bank 58. One byte (the 
opcode) of the packet will indicate whether the register access is a read or write access. If the access is a read, then 
the state machine 59 will ogtput a read signal on line 90 shown in Figure 5. Concurrent with this the least significant 
four bits of the packet address field are placed on lines 91. Some cycles later the register bank 58 under control of a 

10 control block 92 will copy the value in the addressed register onto the data bus 93 one byte at a time, each byte on a 
successive clock cycle. Each byte on the data line 93 is latched into the outgoing buffer 50 and under control of the 
state machine 59, the data read from the register is synthesised into a P-link packet in buffer 50 and specified as a 
'toad response". The destination field for this response packet is copied from a "source" field of a requesting bit serial 
packet. A transaction identifier (TID) whk;h is also provided in each packet, is also copied across. A type byte of the 

75 response packet is formed from the type byte of the request packet and consequently a response P-link packet is 
formed in the outgoing buffer 50 in response to a request packet which was input from an external source to pin 53. 
[0031] If the type of access for registers 58 is a write access then the write line 95 Is asserted by the state machine 
59 together with the address line 91 . Some cycles later the least significant byte of the data is copied from an operand 
field of the packet in buffer 51 onto the data bus 93. On the following seven cycles bytes of successive significance 

20 are copied to the registers 58 until all eight bytes have been copied. A response packet is then synthesised in register 
50 except that "store response" packets do not have data associated with them and comprise only a destination byte, 
a type byte and a transaction identifier byte. This response packet is translated into a bit serial response packet as 
previously described, loaded into shift register 70 and output through pin 52 to indicate to the source of the write request 
that a store has been effected. 

2S [0032] Similarly if the destination byte of a packet received from the P-link system 1 5 by the port 30 is examined and 
indicates that the packet is destined to access registers 58 in the port 30, a similar operation is carried out. Rather than 
being forwarded to the bit serial register 70. the type of field of the packet is used to determine whether the access is 
a read or write access, if the access is a read then Ihe read line 90 of Figure 5 is asserted by the state machine 59 
and the least significant four bits of the packets address field are placed on the address line 91. Two cycles later the 

30 register bank copies the value held in the register which has been addressed onto the data line 93 one byte at a time 
each on successive cycles. This is latched into buffer 51 and the state nnachine synthesises a P-link packet which is 
specified as a "read response" packet. The destination field for this response packet is copied from the source field of 
the requesting bit serial packet. The transaction identifier is also copied across. The type byte of the response packet 
is formed from the type byte of the request packet. 

35 [0033] If the type of access required is a write access then state nnachine 59 asserts the write line 95 together with 
the address line 91. Some cycles later the least significant byte of the data is copied from the operand field of the 
packet in buffer 50 to the data line 93. On the following seven cycles bytes of successive significance are copied to 
the data lines 93 and copied into the registers until all bytes have been copied. A response packet is then synthesised 
as previously described except that "store response" packets do not have data associated with them and comprise 

40 only a destination byte, a type byte and a transaction identifier byte. This response packet is then forwarded to the P- 
link interface 23 where it is returned to the issuer of the request packet which have been input through the P-link 
interface 93 In order to access the port registers 58. 

[0034] From the above description it will be understood that the packet formats shown in Figures 6, 7 and 8 include 
packets that form a request or a response to a read or write operation. In addition to each packet including a destination 

4S indicator for the packet (numeral 73 in Figures 6 and 7 or numeral 84 in Figure 8) the packets include a (TIO) transaction 
identifier 98 and an indication of the source 99. The packets may need to identify a more specific address at a desti- 
nation. For this reason an address Indicator 100 may be provided. As already described in relation to register access 
at the port 30, the destinatbn identifies the port although the address 1 00 is used to indicate the specific register within 
the port. The Destination field is a one byte field used to route the packet to the target subsystem or module connected 

50 to the P link 1 5. For request packets it is the most significant byte of the address to be accessed. For a response packet 
it identifies the subsystem which issued the request The source field is a one byte field which is used as a return 
address for a response packet. The Address field is provided by the least significant 3 bytes of the request address 
The TID field is used by the requester to associate responses with requests. The TID enables a module to identify 
response packets corresponding to respective request packets in cases where a plurality of request packets have been 

^5 sent before response packets have been received for each request packet. 

[0035] It will be appreciated that by using a bit serial port low cost access is provided to a chip, requiring only a small 

number of pins for access, and may be particularly used for debugging a CPU by use of an external host. 

[0036] In this example each CPU 12 and 13 is arranged to execute an instruction sequence in conventional manner 
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The instruction set will include a plurality of conventional Instructions for a microcomputer, such as load and store used 
lor memory accesses, but this example also includes an instruction to send an "event" packet. An "event" is an excep- 
tional occurrence normally caused by circumstances external to a thread of instructions. An event packet may be sent 
by execution of an event instruction although hardware In the form of the event logic in any module connected to the 
P-iink may generate some events and event packets without the execution of instructions in a service or handler routine. 
[0037] Events which originate from execution of an instruction by a CPU are caused by execution of the event in- 
struction. This can be used to send an "event" to a CPU such as one or other of the CPU's 1 2 or 1 3 on the same chip 
or it may be used to send an event to a CPU on a different chip through an external connectran. The CPU which 
executes the event instruction may also send an event to a further module connected to the P-link system 15. The 
event instruction has two 64 bit operands, the event number and the event operand. With regard to the event number 
0-63, bit 1 5 is used to determine whether or not the event is a "special event". A special event is used in a control 
signal packet to which a recipient module or CPU must respond regardless of the priority level at which the CPU is 
currently operating. When bit 15 is set to 1, bits 0-14 are used to define the type of special event. Bits 16-63 of the 
event number are used to identify the destination address of the CPU or module to receive the special event. The bit 
numbers referred to above in the event number may be mapped to different locations in the packet as shown in Figure 
6. For example, the opcode Identifying the packet as an event request will be located at byte position marked 74 in 
Figure 6 but the bits determining the type of event (EN code) will be positioned in the address section 100 as this is 
not needed for extra address indication in the case of an event packet. The types of special event are set out below: 



Event Name 


EN.CODE 


EN.OPERAND 


Function 


EVENT.RUN 


1 


Ignored 


Resumes execution from suspended state of 
the receiving CPU 


EVENTRESET 


3 


Ignored 


Generate a reset event on the receiving CPU 


EVENTSUSPEND 


5 


Ignored 


Suspends execution of the receiving CPU 


EVENTSET RESETHANDLER 


7 


boot address 


RESETHANDLER SHADOW ^ RESET 
HANDLER 

RESETHANDLER 4- boot address 



[0038] These special events may be sent from one CPU 12 or 13 to the other or altematively they may be sent 
through the debug port 30 from an external host to either of the CPU's 12 or 13 on chip. The "event" will be sent as a 
bit packet of the type previously described. 

[0039] In response to a special event, which acts as a CPU control packet, either CPU 12 or 13 can be made to 
cease fetching and issuing instructions and enter the suspended state. 

[0040] When an EVENTSUSPEND is received by a CPU it sets a suspend flag This flag is OR-ed with the state of 
the suspend pin to determine the execution stage of the CPU. 
[0041] The suspended state may bo entered by: 

40 . Asserting the SUSPEND PIN. This stops all CPUs on the chip. 

• Sending an EVENTSUSPEND to a CPU. This suspends only the receiving CPU. 
[0042] The suspended state may be exited by eKher of: 

45 

• Changing an external SUSPEND PIN from the asserted to negated stage. This causes all CPU(s) which do not 
have their suspend flags set to resume execution. 

• Sending an EVENT.RUN special event to a CPU. This clears the suspend flag. If the SUSPEND PIN is negated 
this causes the receiving CPU to resume execution. 

[0043] Entering the suspended state causes a CPU to drain the execution pipelines. This takes an implementation 
defined period of time. While a CPU is suspended its execution context may be changed in any of the following ways; 

• The reset address control register RESETHANDLER may be changed. 

• The CPU may be reset. 
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• E)ctemal memory may be changed by DMA, e,g using the debug link 30. 

[0044] At hard reset, (that is reset of all state on the chip) if the SUSPEND PIN is asserted at the active edge of the 
hard reset the CPU(s) state will be initialized but will not boot. The CPUs will boot from the addresses contained in the 
5 RESETHANDLER set prior to the reset event when they enter the running stale. 

[0045] The EVENT.RESET causes the receiving CPU to perform a soft reset. This type of reset causes the key 
internal state to be initialized to known values while saving the old values in dedicated shadow registers such as to 
enable debugging software to determine the state of the CPU when the reset took place. 

[0046] The instruction execution system for CPU 12 or 13 and its relation with the special event logic unit 44 wilt be 
TO described with reference to Figure 9. In normal operations the CPU fetch and execute instruction cycle is as follows. 
A prefetcher 101 retrieves instructions from the instruction cache 42 and the instructions are aligned and placed in a 
buffer ready for decoding by a decode unit 102. The decode unit 102 standardises the fornnat of instructions suitable 
for execution. A despatcher circuit 103 controls and decides which instructions are able to be executed and issues the 
instructbns akDng with any operands to the execution unit 104 or a load/store unit 105, The microcomputer chip of this 
15 embodiment has in addition the special event logic 44. This unit 44 can accept commands which originate from packets 
on the P-link system 15 through the interlace 23 so as to override the normal instruction fetch sequence. On receipt 
of an "event suspend' packet the special event logic 44 will cause the prefetcher 101 to cease fetching instructions 
and cause the despatcher 103 to cease despatching instructions. The execution pipeline of instructions is flushed. A 
"event run" packet will cause the special event logic 44 to cause the prefetcher to resume fetching instructions provided 
20 the suspend pin is not asserted. In addition to stopping or starting normal execution instruction, the special event logic 
44 can cause the "instruction stream" state to be reinitialized by a soft reset which is initiated by software when the 
chip is already running and resets only some of the state on the chip. Furthermore a packet can overwrite the register 
which holds the address on which code is fetched following a reset operation. 

[0047] The special event logic 44 will now be described in greater detail with reference to Figure 10. 

2£ [0048] Figure 10 shows the special event logic 44 connected through the link interface 23 to the P-link system 15. 
As is shown in more detail in Figure 10, the interface 23 is connected through a bus 110 to the special event logic 44 
which comprises in more detail the following components. An event handler circuit 111 which is connected by line 112 
to the instruction fetching circuitry 101 and by line 113 to the instruction despatcher 103. The bus 110 is also connected 
to event logic circuitry 114 which has a bi-directional communication along line 115 with the event handler circuit 111 . 

30 The event logic circuitry 114 is connected with a bi-directional connection to counter and alarm circuitry 116 as well as 
a suspend flag 117. A suspend pin 11B is connected to the event logic 114. A reset handler register 119 has a bi- 
directional communication with the event logic 114 along line 120. It is also connected to a shadow reset handler 
register 121. 

[0049] The operation of the circuitry of Figure 10 is as follows. An instruction may be executed on-chip or be derived 
35 from operation of circuitry on an external chip, which causes a packet to be transmitted on the P-link system 1 5 having 
a destination indicator identifying the module shown in Figure 10. In that case the packet is taken through the interface 
23 along bus 110 to the event handler 111 and event logic 115. The event logic determines whether the special event 
is "event run" or "event reset" or "event suspend' or 'event set reset handler". 

[0050] On receipt of an 'event suspend" the event logic 114 causes the suspend flag 11 7 to be set. The event logic 
'fo 1 14 forms a logical OR of the state of the suspend flag 1 1 7 and the state of the suspend pin 118. The result is referred 
to as the suspend state. If the arrival of the "event suspend" has not changed the suspend state then nothing further 
is done. If the arrival of the "event suspend" has changed the suspend slate then the event logic 114 inhibits the 
accessing of instructions from the cache 42. it does this by a signal to the event handler ill which controls fetching 
of instructions by the fetcher 101 and the despatch of instructions by the despatcher 103. Instructions fetched prior to 
^5 receipt of the "event suspend" will be completed but the CPU associated with the event logic 114 will eventually enter 
a state where no instructions are being fetched or executed. 

[0051] On receipt of an "event run" the event logic 114 causes the suspend flag 117 to be cleared. The event logic 
114 performs a logical OR of the state of the suspend flag 117 and the suspend pin 118. The result is known as the 
suspend state. If the arrival of the "event run" has not changed the suspend state then nothing further is done. If the 
^0 arrival of the "event run "has changed the suspend state then the event logic 11 4 ceases to inhibit access of instructions 
from the cache 42. A signal passed through the event handler 111 indicates to the fetcher 101 that the CPU should 
resume its fetch-execute cycle at the point at which it was suspended. 

[0052] In the event of receipt of an "event set reset handler" the event logic 114 causes the operand which accom- 
panies the special event in the packet, to be copied into the reset handler register 119 and the previous value that was 
55 held in register 11 9 is put into the shadow reset handler register 1 21 . 

[0053] On receipt of an "event reset" the event logic 114 causes the event handler 111 to cease its current thread of 
execution by providing a new instruction pointer on line 112 to the fetcher 101 and thereby start executing a new 
instruction sequence whose first instruction is fetched from the address given in the reset handler register 1 99. That 
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TO 



75 



20 



25 



30 



35 



40 



new address is obtained on line 120 through the event logic 114 to the event handler 111 prior to being supplied to the 
fetcher 101, 

[0054] It will therefore be seen that by use of the special events which may be indicated in a packet on the P-link 
system 15, sources on-chip or off-chip may be used to suspend the fetching and execution of instructions by a CPU 
or to resume execution of a suspended CPU. It may also be used to reset a CPU into an initial state or to provide a 
new boot code for the CPU from anywhere on the P-Hnk system or anywhere in an interconnected network using the 
external port 30 so that it forms part of the physical address space throughout the network which may be accessed by 
the CPU. 

[0055] More detailed Figures showing the special event logic 44 are provided in Figures 15, 16 and 17. Figure 15 
shows the P-link system 15 including a Receive buffer 140 and a Transmit buffer 141 adjacent the interface 23. When 
a packet including a special event is received in the buffer 140, inputs may be provided on lines 142, 143 and 144 to 
special event decode logic 145. When bit 15 of the event number is set to 1 thereby indicating a special event, a P 
valid signal is provided on line 142 to the decode logic 145. At the same time the event code field of the packet is 
supplied on line 143 to the decode logic 145 and the event operand field is supplied on line 144 to the decode logic 
145. In response to assertion of the P valid signal on line 142, the decode logic 145 decodes the event code field as 
indicated in the following table: 



P„en.code 


Signal asserted 


Ev„handle 


001 


Ev_run 




Oil 


Ev_reset 




101 


Ev_Susp 




111 


Ev_set 


P„en.op 



[0056] On the cycle of operations following decoding, the decode logic 145 outputs a signal on line 146 P Event 
done to clear the buffer 140. Depending on the result of decoding the signal on line 143, the decode logic may output 
either an Event Run signal on line 147 or an Event Suspend signal on line 148 to suspend logic 1 49 connected to the 
suspend pin by line 150. Altematively decoding of the signal on line 143 may cause the decode logic 145 to output an 
Event Reset signal on line 151 to the CPU pipeline circuitry 152. Alternatively the decode logic 145 may output an 
Event Set Reset Handier signal on line 153 to the reset handier logic 154 together with the operand value on bus 1 56. 
[0057] Figure 16 illustrates the suspend logic 1 49. Lines 147 and 1 48 form inputs to an SR latch 1 57 which provides 
a second input 158 to an OR gate 159 having the suspend pin providing the other input 150. In this way the signal on 
line 147 is logically or-ed with the suspend pin to generate a fetch disable signal on line 160 which includes a latch 
161 providing the suspend flag. The signal on tine 160 has the effect of inhibiting the fetching of instructions from the 
instruction cache 42. This eventually starves the CPU of instructions and the CPU execution will be suspended. As- 
sertion of the signal on line 148 will clear any previously asserted signal on line 147 in the normal operation of the SR 
latch 157. 

[0058] Figure 17 illustrates the reset handler logic 154. When the Event Set on line 153 is asserted, this is supplied 
to a reset handler state machine 162 connected to a register bus 163 interconnecting the reset handler register 11 9, 
shadow reset handler register 121 and the instruction pointer bus 112. The response to assertion of signal 153 is as 
follows: 



45 



1 The state machine 162 asserts the read line 164 of the reset handler register 119 which causes the value in the 
reset handler register to be read onto the register bus 1 63. 



2 The state machine 1 62 asserts the write line 165 of the shadow reset handler register 1 21 causing the value on 
the register bus to be written into the shadow reset handler register. 



50 



3 The state machine 162 causes the value on the Ev_handle bus 156 to be put onto the register bus. 



4 The state machine 162 asserts the write line 164 of the reset handler register 119 which causes the value on 
the register bus to be copied into the reset handler register 119. 



55 



[0059] Altematively if a getjptr_sig is asserted on line 1 66 from the CPU pipeline 1 52 then the following occurs. The 
state machine 162 asserts the read line {RAN) of the reset handler register which causes the value in the reset handler 
register to be read onto the register bus. This value is transferred along the line labelled IPTR. 
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[0060] The following method may be used to boot one or other of the CPUs 12 or 13 of Figure 1 v^/hen the chip is 
connected to an extemal microcomputer through the port 30 similar to the arrangement shown in Figure 11. The two 
CPUs 12 and 1 3 may be connected to a common suspend pin 118. When pin 1 18 is asserted, after the hard reset pin 
46 has been asserted, both CPUs are prevented from attempting to fetch instructions. The external link 30 and external 
5 microcomputer 123 can then be used to configure the minimal on-chip state by writing directly to control registers on 
chip 11 and storing the necessary boot code into the DRAM memory connected to bus 33 of chip 11 , When the state 
of the suspend pin is changed one of the CPUs can boot from the code now held in the DRAM for the chip 11 . To 
achieve this, the suspend pin 118 is changed to an assert state after a hard reset has been asserted. The external 
microcomputer 123 sends packets through the port 30 to write boot code into memory 120 shown in Figure 11 . The 
JO host 123 then executes an instruction to send the special event EVENT SET RESET HANDLER to the selected one 
of microcomputers 1 2 or 1 3 and in this example it will be assumed to be CPU 1 3. This will provide a new target address 
in the reset handler register 119 for CPU 1 3. The host 113 will then execute an instruction to send through the port 30 
a special event EVENT SUSPEND to the other CPU 1 2, This will set the suspend flag 1 1 7 of CPU 1 2. The assert signal 
on tho suspend pin 118 is then removed so that CPU 13 will start executing code derived from memory 120 from the 
'5 target boot address held in the reset handler register 119. CPU 12 will remain suspended due to the start of its suspend 
flag 117. When it is necessary to operate CPU 12. it can be started by CPU 13 executing an instruction to send to CPU 
12 the special instruction EVENT SET RESET HANDLER. This will change the default boot address held in the reset 
handler register 119 of the CPU 12. CPU 13 must then execute an instruction to send the special event EVENT RUN 
to CPU 12 which will, as described above, start execution of CPU 12 with code derived from the address in the reset 
20 handler register 119 of CPU 12. 

[0061] In this way the microcomputer of Figure 1 can be booted without the requirement of having valid code in a ROM. 
[0062] Although the above described boot procedure used boot code which had been loaded into the local memory 
120 for the chip 11 . the similar procedure may be followed using code located in the memory 125 which is local to the 
external microcomputer 123. To achieve this.^he same procedure, as above, is followed except that the special event 
^-5 which is sent through port 30 to load the reset handler register 11 9 of CPU 1 3 will provide a target address for the boot 
code which is located in the address space of the port 30. In this way when the assert signal is rennoved from the 
suspend pin 118, CPU 13 will start fetching code directly from the external computer and external memory When CPU 

12 is needed it can be started by CPU 1 3 as previously described. 

[0063] By arranging for the host 113 to send the special instruction EVENT SUSPEND to CPU 12 prior to removing 
30 the assert signal from suspend pin 118 it is possible to reduce the amount of instruction fetching through the port 30 
since CPU 1 3 may boot alone and then arrange for CPU 1 2 to boot rather than attempting to boot both CPUs 1 2 and 

13 from the extemal microcomputer through the port 30. 

[0064] Watchpoint registers may be used to monitor the execution of a program. These registers may be used to 
initiate a debug routine when a particular memory store is addressed or alternatively when instructions from a particular 
5,5 location are executed. 

[0065] Various examples of use of the chip 11 in a network having a plurality of interconnected chips are shown in 
Figures 11 to 14. 

[0066] In the example of Figure 11, the chip 11 is shown for simplicity with the single CPU 12 as CPU 13 is not 
involved in the operation described with reference to Figure 11. The chip is connected through the extemal memory 
^0 interface and bus 33 to a memory chip 1 20 which is local to the CPU 1 2 and forms part of the local address space of 
the CPU 12. The port 30 is connected by two serial wires 121 and 122 to a further microprocessor chip 123 which in 
this case forms a debugging host for use with chip 11 . Line 121 provides a unidirectional input path to chip 11 and line 
122 provides a unidirectional output path to the host 123. The host 123 is connected through a bus 124 to a memory 
chip 125 which is local to the host microcomputer 123 and thereby forms part of the local address space of the host 
^5 microcomputer 123. In order to carry out debugging operations on the CPU 12, the host microcomputer may operate 
software derived on-chip in the microcomputer 123 or from its local memory 125 so that the host 123 causes special 
events, as previously described, to be issued in packets along the serial line 121 through the port 30 onto the P-link 
^ system 15. These may have the destination address indicating the CPU 12 so that this special event is handled as 

already described with reference to Figure 10. This may be used to suspend the CPU 12 at any time and to replace 
^0 the value in its reset handler register and to reset the CPU 1 2 either from its previous state or from a new state indicated 
by the value in the register 119. The CPU 12 may have part of its address space located in addresses of the memory 
1 25 local to the host 123. The port 30 fonms part of the local address space for the CPU 1 2 and consequently a memory 
access may be made to the address space allocated to the port 30 and in this case the response may be synthesised 
by software running on the host microcomputer 123. It is therefore possible to set the reset handler register 119 to be 
^5 an address local to the host rather than local to the CPU 1 2. In this way a host can, independently of operation of the 
CPU 12, establish itself as the source of the instructions and/or data to be used by the CPU 12. This mechanism may 
be used to initiate debugging from the host 123, In the case of a chip 11 having tv/o CPUs 12 and 13. it is possible to 
debug software running on CPU 12 as already explained while leaving software running on CPU 13 unaffected by the 
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debug operation being carried out on CPU 12. This is the position shown in Figure 12 where the second CPU 13 is 
shown in broken lines and is operating normally in obtaining instructions from its instruction cache or from the memory 
120 quite independently of the debug routine operating on CPU 12 in conjunction with the host 123. 
[0067] Figure 13 shows an alternative arrangement in which the network is generally similar to that described with 

5 reference to Figures 11 and 12. However in this case the CPU 12 is provided with a data watchpoint register 130 and 
a code watchpoint register 131 in which respective addresses for data values or instruction locations may be held so 
as to initiate a debug routine if those watchpotnts are reached. In this example, the host microcomputer 123 can. at 
any point during the execution of a program by the CPU 12, briefly stop execution of the CPU 12 and cause the 
watchpoint slate in the registers 130 or 131 to be modified and return control to the original program of the CPU 12. 

^0 When the CPU 1 2 executes an instruction which triggers a watchpoint as set in either of the registers 1 30 or 1 31 , it 
stops fetching instructions in its normal sequence and starts fetching and executing instructions starting from the in- 
struction specified by the content of a debug handler register 1 32. If the debug handler register 1 32 contains an address 
which is local to the host 123 rather than local to.the CPU 12, the CPU 12 will start fetching instructions from the host 
123. In this way the host can establish the watchpoint debugging of a program which is already running without using 

^5 any of the memory local to the CPU 12 and without requiring the program of the CPU 12 to be designed in a manner 
cooperative to that of the debugging host 123. In this way the examples described provides for non-cooperative de- 
bugging. The operating system and application software for the CPUs on the chip 1 1 do not need to have any knowledge 
of how the debugging host computer 1 23 will operate or what operating system or software is incorporated in the host 
123. 

20 [0068] In conventional computer architectures watchpoint triggers are handled using a vector common to traps or 
events managed by the operating system. These traps and events use a conventional set of registers marked 134 
which provide the address of the handler routine. In the example described, an extra register set 1 35 is provided which 
includes the debug handler register 132 and a reset handler register 136. In this manner independence from the op- 
erating system is established by providing the extra register set 135 in which the address of the handler routine for 

25 watchpoint handling routines may be found. 

[0069] Figure 1 4 shows the same network as previously described with reference to Figure 1 2. In this case the host 
123 is provided and connected to the port 30 so that it may operate as previously described for use In debugging and 
the transmission of special events through the port 30, However in cases where it is necessary to monitor the debugging 
of one of the CPUs 12 or 13 as quickly as possible in debugging real time code, this example may be used to carry 

^0 out debugging of one of the CPUs 12 or 13 by use of the other of the CPUs 12 or 13 instead of the host 123. The 
transfer of packets along the P-link 15 on-chip may be performed faster than external communications through the 
port 30. In this case either of the CPUs 12 or 13 may execute instmctions which send special events to the other CPU 
on the same chip and thereby carry out a debugging operation as previously described with reference to use of the 
host 123 although in this case the control will be carried out by one of the on-chip CPUs in effecting a debugging 

35 operation of the other CPU on the same chip. 

[0070] It will be seen that in the above example the external host 123 can be used to carry out debugging of either 
of the on-chip CPUs 12 or 1 3 without restrictions on the operating systems or application software of either of the on- 
chip CPUs. The watchpoint debugging may be earned out without the need to use memory local to the on-chip CPUs. 
Both on-chip CPUs 11 and 12 and the host 123 which is externally connected have access to each other's state by 

40 packet communications through the port 30. The on-chip CPUs 12 and 1 3 can access the external memory 125 inde- 
pendently of any operation of a CPU in the host 123. This allows the on-chip CPUs to access code from a memory 
which is local to an externally connected microcomputer 

[0071] The external host may comprise a computer or a computer device such as a programmable logic array. 
[0072] As already explained with reference to Figures 1 and 1 8. the modules 1 4 as well as CPU's 1 2 and 1 3 include 
^5 circuitry which may generate request packets and receive response packets covering data transfers that are involved 
in memory accesses as well as normal event packets acting as interrupts and special event packets acting as obligatory 
control commands for a recipient device. Each of these packets may be distributed on the data and address paths 1 5 
which is common to all types of packet distributed on the same chip. The packets are distributed in parallel format on 
the path 15. The range of transactions covered by the packets are shown in the following table. 

so 





Request 


Ordinary Response 


Transaction 


Opcode 


Packet Length 


Opcode 


Packet Length 


LoadWord 


0x09 


8 


0x29 


11 


Load2 


OxOA 


7 


0x2A 


19 


Load3of4 


OxOB 


7 


0x28 


27 
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(continued) 



10 



IS 





Request 


Ordinary Response 




Opcode 


Packet Length 


Opcode 


Packet Length 


Load4 


OxOC 


7 


0x2C 


35 


StoreWord 


Oxil 


16 


0x31 


3 


StorQ2 


0x12 


23 


0x32 


3 


StorG3of4 


0x13 


31 


0x33 


3 


Store4 


0x14 


39 


0x34 


3 


Swap 


0x19 


15 


0x39 


11 


Event 


0x01 


16 


0x21 


3 



[0073] This table shows in the left hand column the type of transaction which may be designated by the packet and 
will be identified by the Opcode 74 of each packet. A different Opcode and different packet length is used for request 
and response packets as shown in the above table. All transactions listed in the table are memory access transactions 

20 apart from the event transaction. Although Figure 7 showed a general form of response packet it will be understood 
that various transactions including the event transaction do not require data in the response packet The response 
packet merely confirms to the source that the request packet was received. The memory access transactions listed 
above includes "LoadWord" which reads up to 8 bytes of data from a memory location. Load2 transaction reads 16 
bytes of data from a 16 byte aligned location in memory. Load3of4 transaction reads 24 bytes of data from a 32 byte 

25 aligned location in memory. Load4 transaction reads 32 bytes of data from a 32 byte aligned location in memory. The 
various "Store" transactions are the equivalent transactions for writing multiple bytes of data into memory. 
[0074] The event transaction may be a special event as already described. It may also be used to designate a normal 
event or interrupt. The request and response packets. for LoadWord, StoreWord and Event are shown in Figures 21 
to 24. Similar formats are used for the other transactions listed in the table above. Each includes a designation indicating 

30 byte for use by the P-router control 22 of Figure 1 to ensure that the correct device connected to the P-link 1 5 receives 
the packet which is put on the address and data path 15. Each packet includes the Opcode adjacent the destination 
indicator so that the recipient may decode the nature of the transaction required. Similarly each request packet has a 
TID indicator and source indicator as already indicated so that the recipient device may decode the packets according 
to a common format and provide response packets which also have a common format for decoding by the source of 

3S the request packet. 

[0075] In the case of the event transactions, the event request packet does not require the additional 3 bytes of 
address location provided In section 100 of Figure 6. Consequently the bit pattern used to identify the type of event 
(corresponding to the significant bits of the Event number referred to in connection with the event instruction) are 
located in section 100 used for additional address information in the other types of transaction. 

40 [0076] Each of the above transacting packets for use on the P-link 15 can be generated by a hardware operation 
such as the event logic in any of the modules or it may be generated in response to a software operation such as the 
execution of an instruction by the CPU. The format of the packets used on the P-link 1 5 is the same whether the packet 
is in response to a hardware operation or a software operation. The GPU 12 may execute an instruction such as an 
event instruction in order to directly generate a packet for use on the P-link 15. Alternatively it may execute an instruction 

45 which causes some other device to use hardware circuitry to generate a transaction packet of the type described above. 
In seeking instructions or data from the on-chip caches 42 or 43 it may be necessary to carry out a memory access 
operation in order to obtain data or instructions from memory which are not already in the respective cache. The cache 
control circuitry may then include circuitry similar to the event logic of the nnodules 1 4 so as to generate a memory 
access packet in response to the instruction execution by the CPU where the required instruction or data was not 

so already found in the cache. 

[0077] The instruction set of the CPU includes a plurality of load and store instructions corresponding to the various 
transactions listed in the table included earlier in the specification. The load and store instructions for memory accesses 
will generate a memory address using a base pointer with an index or offset. This will be used in the address portion 
of a request packet as previously described. The destination for such a packet will identify the interface of either an 

55 on-chip memory or an external memory. The type of instruction executed vvill determine the opcode of the packet. 

[0078] The transaction packets which are distributed on the on-chip P-link 15 may be output or input through the 
debug port 30 from an external chip in a network of the type shown in Figure 1 6. It will be understood that the transaction 
packet will then be changed from an on-chip parallel form to a serial bit form (or to a less serial bit form than that used 
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in the parallel format on the on-chip P-link 1 5). The serial bit packet will as previously described include a packet length 
indicator for use in the serial transmission on wires 10 between adjacent chips. In the arrangement shown in Figure 
18 the network comprises two or more similar chips each having its own system of address and data path 15 with 
attached modules 15 and CPU 12. In this way. the address space used on any chip in the network can be accessed 

5 directly by any other chip in the network so that each P-link connected in the network provides part of a commonly 
accessible address space. To distinguish between similar addresses on different chips, the P-router control 22 of each 
chip may include a plurality of look-up tables corresponding to the number of chips in the network. The P-router control 
may be reset to a look-up table providing addresses for its own on-chip addresses while the CPU may enable access 
to a different look-up table corresponding to a different chip in order to use a packet destination address which Identifies 

fo the correct destination on an interconnected chip accessible through the external port of the chip on which the source 
device is located. In this way the plurality of interconnected chips may use a common extended address space acces- 
sible by packets generated on any one chip with a destination address identifying a required destination on any inter- 
connected chip. 

[0079] Returning to the event packets, these will have a bit pattern extending over 2 bytes which identify the type of 
15 event as shown in the following table: 



Name 


Bits 


Function 


EN. VIP 


0-4 


Virtual Internjpt Pin to deliver event to. 


EN.TYPE 


5-6 


0 Edge event 

1 Level-on event 

2 Reserved 

3 Level-off event 


EN.CODE 


0-14 


Special event code 


EN. 

SPECIAL 


15 


Whether the event is a special or soft-reset event. If zero, the event is a normal event so 
EN. VIP and EN.TYPE are valid but eventoperand is not used. If one, the event is a 
special or soft-reset event, and ENXODE and the event.operand are valid. 




16-63 


Destination CPU/Device event address. 



[0080] This bit pattern will be located, as previously described, in the address section 100 of the packet shown in 
Figure 23. If the bit 15 is not zero then the event is a special Event as already described. If however bit 15 is zero then 
bits 0-6 identify the priority and type of a normal interrupt event to which the recipient may selectively respond. As 
indicated in the table above bits 0-4 indicate a virtual interrupt pin for use at the destination thereby indicating the 
priority of the normal event. Bits 5-6 indicate whether the Event is responsive to an edge detected event or a level 
detected Event. Signals 173 and 174 are supplied to virtual interrupt pin logic 175 which is shown in more detail in 
Figure 26. The incoming signals 173 and 174 are fed through a selector 176 to a register bank comprising virtual 
interrupt pins 0-31 . Each of these pins has a hard wired priority level corresponding to the number of the pins. In other 
words, pin 0 has the highest priority referred to as priority 0 and pin 31 has the lowest priority 31 . A register 178 holds 
a priority Indicator of the current thread being executed by the CPU, A priority encoder 1 79 checks the range of virtual 
interrupt pins 177 to locate which pins now have an indication of an awaiting event. This encoder 179 then provides a 
signal on line 180 to a comparator 181 which compares the priority of any arriving event with the current CPU priority 
indicated in register 178. In this way the VIP logic 175 is able to make a selective decision depending on the priority 
of the incoming event packet as to whether or not the CPU should at this time take action to respond to the Event 
packet or not. If the priority encoder 179 indicates that the incoming event has higher priority than that indicated in 
register 178 an event launch signal is provided on line 182 to the CPU pipeline 183 shown in Figure 25. The CPU 
pipeline 183 is provided with access to the instruction cache 42 and has a register file 184 which is used in the output 
of a transaction packet through a transmit buffer 185 connected to the P-link 15. The CPU pipeline 183 is also provided 
with a look-up table to provide an identification of instructions for an interrupt routine depending upon the source and 
device identifier of an event packet. If the VIP logic 175 determines that the CPU should respond to the Event packet, 
the Event decode logic 172 provides on line 186 details of the source and device ID of the event packet which is 
supplied on a control bus 187 to the CPU pipeline 183. This enables the CPU pipeline to identify the source of instruc- 
tions for the interrupt routine appropriate to that source and device. To enable the CPU to resume the interrupted thread 
after responding to the event, the CPU priority held in the register 178 is transferred into a save priority register 187 
so that at the end of the interrupt routine the original priority held in register 167 can be reestablished in register 178. 
Similarly the CPU pipeline 183 must both save the instruction pointer and thread status word appropriate to the inter- 
rupted thread for use in resumption of the thread after execution of the interrupt. These values are held in registers 
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188 and 189. 

[0081] In order to generate the event-packets in the event logic 8 of each module shown in Figures 1 and 1 8, event 
logic and packet generator circuitry of the type shown in Figure 27 Is used. A packet buffer 190 is arranged to output 
a packet onto the P-link 15 via interconnection 191 when the packet has been assembled. The event logic will include 

5 a signal level or edge detector 1 93 in order to initiate the generation of an event packet. The particular device or module 
in which the event logic is located will store in a register 1 94 details of the source and a device identifier for the particular 
device at the source which is giving rise to the event packet. The module will also include In register 195 details of the 
destination address for any event packet generated by that module. Register 196 will hold details of the event trans- 
action which will be requested by that device. It will have the bit pattern necessary to decide whether the event is a 
special event or a normal event and in the event of a normal event it will provide the event priority and event type. If 
that module is providing a special event, then an operand may be needed and this will be held in an operand register 
1 97 for concatenation in an operand section of the packet buffer 1 90. If the device or module is also required to produce 
a memory access transaction packet then buffer 202 will operate under a control unit 1 98 to locate In the packet buffer 
1 90 an indication of the memory access transaction rather than an event transaction. The device may well output more 

^5 than one request packet prior to receiving any response packet. For this reason a counter 1 99 is provided to count the 
number of packets output and it also receives an input of the number of response packets received by buffer 200. The 
connection of event transaction data or memory access transaction data into the packet buffer 191 is controlled by a 
selector 201 controlled by the control unit 198. The contents of the registers and function of the control unit 198 will 
be determined by the particular implementation of the transaction packet generating circuitry at any particular device 

20 or module. 

[0082] The invention is not limited to the details of the foregoing example. 



Claims 

25 

1. A computer system including an integrated circuit chip with an address and data path interconnecting a plurality 
of on-chip devices Including at least one CPU, at least one module and a memory interface, (a) said module having 
packet generating circuitry responsive to an event to generate an event request packet including a destination 
address, (b) said CPU having event logic to decode the packet and identify the request of the packet, and circuitry 

30 to generate addressed memory access packets, and (c) said address and data path being used for distribution of 

both event request packets and memory access packets. 

2. A computer system according to claim 1 in which both said module and said CPU each include packet generating 
circuitry operable to generate both event request packets and memory access packets for distribution on the com- 

35 mon address and data path. 

3. A computer system according to claim 1 or claim 2 in which the packet generating circuitry is responsive to receipt 
of an event request packet to generate an addressed response bit packet for distribution on said address and data 
path. 

40 

4. A computer system according to any one of the preceding claims in which the packet generating circuitry of a 
module includes means to indicate the address of the destination for the packet as well as the address of the 
module acting as a source of the packet. 

^5 5. A computer system according to claim 4 in which the packet generating circuitry is responsive to receipt of an 
event request packet to determine from the packet a source address of the packet and to generate a response 
packet using said source address as the destination indicator for the response packet. 

6. A computer system according to any one of the preceding claims including an on-chip memory, said memory 
50 interface providing connection between said address and data path and said on-chip memory. 

7. A computer system according to any one of the preceding claims including an off -chip memory, said chip having 
an external memory interface connected to said off -chip memory and to said address and data path. 

55 8. A computer system according to any one of the preceding claims in which the packet generating circuitry of said 
module is arranged to generate an event request packet forming an interrupt request with a priority indicator and 
said event logic of the CPU includes comparator circuitry for comparing priorities of event request packets received 
with the priority of any current CPU activity 
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9. A computer system according to any one of claims 1 to 7 in which the packet generating circuitry of said module 
is arranged to generate an event request packet in the form of a control packet for control command to the CPU. 

10. A computer system according to any one of the preceding claims in which a plurality of modules are provided on 
chip, each having packet generating circuitry for generating event request packets, at least one module being 
arranged to generate event packets in the form of prioritised interrupt requests and at least another module being 
arranged to generate event request packets in the form of control packets for the CPU. 

11. A computer system according to any one of the preceding claims in which said address and data path includes at 
least one on-chip bus arranged to distribute said packets in bit parallel format. 

12. A computer system according to claim 10 wherein said integrated circuit chip includes at least one external port 
for off-chip connection, said port including bit format translation circuitry to convert on-chip packets of bit parallel 
format to a less parallel format for transmission off-chip. 

13. A computer system comprising at least two integrated circuit chips, each as claimed in any one of the preceding 
claims, the two chips being connected through external ports to allow communication between the address and 
data paths of the interconnected chips. 

14. A method of operating a computer system comprising an integrated circuit chip with an address and data path 
interconnecting a plurality of on-chip devices including at least one CPU, at least one module and a memory 
interface, which method comprises detecting an event at a module, generating an event request packet with a 
destination address, distributing the request packet on the address and data path to the destination, decoding the 
packet at the destination to identify the nature of the request, said method further including generating addressed 
memory access packets for memory read and write operations, said memory access packets and said event re- 
quest packets being distributed on the same address and data path. 

15. A method according to claim 14 wherein event request packets are generated for distribution on the address and 
data path to the CPU, which event request packets are in the form of prioritised interrupt requests. 

16. A method according to claim 14 or claim 15 in which event request packets are generated for distribution on said 
address and data path to the CPU, at least some of said event request packets being in the form of control command 
packets to which the CPU must respond on receipt of the packet. 
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Fig.25. 
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